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DUPLICATE 



DERIVATISED MOLECULES FOR MASS SPECTROMETRY 



All documents cited herein are incorporated by reference in their entirety. 



TECHNICAL FIELD 

This invention relates to derivatised biopolymers and ions obtainable therefrom. The invention 
5 further relates to compounds and solid supports useful for producing the derivatised biopolymers and 
ions of the invention. 

BACKGROUND OF THE INVENTION 

Mass spectrometry is a versatile analytical technique possessing excellent detection range and speed 
of detection with respect to High Performance Liquid Chromatography (HPLC), Gas 
1 0 Chromatography (GC), Infira-Red (IR) and Nuclear Magnetic Resonance (NMR). 

However, many biopolymers, such as carbohydrates and proteins, are difficult to analyse using mass 
spectrometry due to significant difficulties in ionising the biopolymer, even using Matrix Assisted 
Laser Desorption/Ionisation Time Of Flight (MALDI-TOF) techniques. Despite the considerable 
resolving power of 2D-PAGE, this technology has fallen far short of the ultimate goal of displaying 
15 the whole proteome in a single experiment, as many proteins are resistance to 2D-PAGE analysis (e.g 
those with low or high molecular masses, membrane proteins, proteins with ' extreme isoelecte^ 
points, etc.). Many proteins are thus invisible to 2-D PAGE [Cravatt & Sorensen (2000) Current 
Opinion in Chemical Biology vol. 4, p. 663-668]. 

There is thus a need for improvements in mass spectrometry analysis of biopolymers. y. 
20 DISCLOSURE OF THE INVENTION 

It has now been found that covalent attachment of trityl derivatives to biopolymers can improve the 
ionisation properties of the biopolymer. The ions (formula (I) below) formed by ionisation of the 
derivatised biopolymers are particularly suitable for mass spectrometry analysis, and biopolymers 
derivatised as specified in formulae (IHa) and (Hlb) below can be readily ionised. 

25 Whereas triphenylmethyl derivatives covalently attached to certain biopolymers (e.g, DNA) are 
known in the prior art [e.g. Chem. Soc. Rev. (2003) 32, p. 3-13], the prior art attaches the polymer to 
the a-triphenylmethyl carbon atom through a non-aromatic linker. In contrast, under the present 
■ invention the biopolymer is attached to the a-triarylmethyl carbon atom via an aromatic group 
adjacent to the central carbon atom. Consequently, ionisation of the prior art derivatives results in 

30 separation of the triphenylmethyl derivative and the biopolymer, whereas according to the present 
invention the biopolymer remains bound to the trityl derivative on ionisation, thereby allowing 
analysis of the biopolymer by mass spectrometry. 

The invention provides methods of forming ions from covalent or ionic compounds and solid 
substrates. 




i' perivatised Biopolymers 

The invention provides a method of forming an ion of formula ©: 



(Ax 2 ) — C— [Ar 1 — (L M {M'— Bp'} p ) q ] m 



0) 



comprising the steps of: 

(i) reacting a compound of the formula (Ha): 

. (Ar 2 ^— C— [At 1 — (L M {M} p )q] 
X 



lm' 



(Ha); 



10 



with a biopolymer, Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a biopolymer derivative of the formula (EUa): 

(Ar 2 ),,— C— [Ar 1 - (L M {M— B P , } p ) q ] m 

X (ma); and 

(ii) cleaving the C— X bond between X and the a-carbon atom of the derivative of 
formula (Etta) to form the ion of formula (I); 



where: 



15 



C* is a carbon atom bearing a single positive charge or a single negative charge; 

X is a group capable of being cleaved from the a-carbon atom to form an ion of formula (I); 

M is independently a group capable of reacting with B P to form the covalent linkage; 

Bp* is independently the biopolymer residue of B P produced on formation of the covalent 



linkage; 



20 



25 



M' is independently the residue of M produced on formation of the covalent linkage; 

Ar 1 is independently an aromatic group or an aromatic group substituted with one or more A; 

Ar 2 is independently an aromatic group or an aromatic group substituted with one or more A; . 

optionally wherein (a) two or Ihxee of the groups Ar 1 and Ar 2 are linked together by 
one or more L 5 , where L 5 is independently a single bond or a linker atom or group; and/or (b) 
two or three of the groups Ar 1 and Ar 2 together form an aromatic group or an aromatic group 
substituted with one or more A; 
A is independently a substituent; 

L M is independently a single bond or a linker atom or group; 
n = 0, 1 or 2 and m = 1, 2, or 3, provided the sum of n+m = 3; 
p independently = 1 or more; and 




) (i) reacting a compound of the formula (Eh): 

(Ar 2 ) n - C- [Ar ! — (L M {M} p ) q ] m 



X * (lib); 
with a biopolymer, Bp, having at least one group capable of reacting with M to form a covalent 
linkage, to provide a biopolymer derivative of the formula (Dlb): 



(ArV- C— [ Ar 1 - (L M {M-— B P '} p ) q ] m 



(IHb); and 



dissociating X* from the derivative of formula (mb), to form the ion of formula (I); 
where: 

X*is a counter-ion to C*; 

and C*, M, B P ', M, Ar 1 , Ar 2 , Lm, n, m, p and q are as defined above. 
10 The compounds of formulae (Ila) or (lib) may optionally be purified after step (i). 
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The invention also provides biopolymer derivatives of the formula (Dla) or (Dlb), as defined above. 
The biopolymer derivatives of the invention have enhanced ionisability with respect to free 
biopolymer, B P . Advantageously, the biopolymer derivatives may not require a matrix (e.g. as used 
in MALDI-MS) in order to elicit ionisation, although a matrix may help to enhance ionisation. 
Preferably, ionisation may be obtained without requiring acid treatment, in particular by ; 4irect laser 
illumination. * 

The invention also provides ions of formula (I), as defined above. These ions are stabilised by the 
resonance effect of the aromatic groups Ar 1 and Ar 2 . Electron-withdrawing groups, when C* is an 
anion, or electron-donating groups, when C* is a cation, may optionally be provided on Ar 1 and/or 
Ar 2 to assist this resonance effect Consequently, the biopolymer derivatives of the invention readily 
form ions of formula (T) relative to the native biopolymer, B P . 

The ions of formula (I) are generally only ever seen on a mass spectrum with a single charge, which 
Is advantageous since it reduces cluttering of the mass spectrum. 

The invention also provides compounds of the formula (Ha) and (lib), as defined above. As 
mentioned above, these compounds are useful for forming ions of formula (I). As the difference in 
the molecular mass of the ions of formula (I) and that of the free biopolymer can be accurately 
calculated, the derivatised compounds of the invention allow analysis of the biopolymer Bp, which 
may be otherwise difficult or impossible to analyse using known mass spectrometrical techniques. 




lion include more uiiiforD^its^of^the^sigaal. ........ 

***** ^^^^J^^^^^^^^t:^ 



( between compounds with different, but close, masses, so that techniques such as isotope coded 
" affinity tagging (ICAT) can be employed with the compounds of the invention.' 

The homogeneous methods of the invention are particularly appropriate for small molecules, e.g. 

amines. 
5 Solid Sitpports 

The ions of formula (I) may also be formed using a derivatised solid support. 
The invention therefore provides a method of forming an ion of formula (T) comprising the steps of: 
(i) reacting a solid support of formula (TVai), (TVaii), or (TV aiii): • 
(Ax 2 )— C— [Ar 1 — (L M {M} p ) q ] m 




S S 



(TVai); 




Ss 



s Ar* (L M {M} p ) q 

(Ar 2 ^— C— [Ar 1 — (L M {M}p) q ] m -i 



10 



X 




S S 



(TVaii); 



^Ax 2 

(Ar 2 ) n .! C— [Ar 1 — (L M {M> p ) q ] m 

X (TVaiii); 

with a biopolymer, B P , having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vai), (Vail), or (Vaiii), respectively : 

" (Ar 2 ) n — C— [Ar 1 — (L M {M"— Bp'}p) q ] m 



(Vai)); 




S S 




Ss 
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Ar^— (U M {M -Bp'Vp), 

(ArV- C— [Ax 1 — (L M {M' Bp^qlm-l 

X (Vaii); 



" • ®... , 

At 2 

(Ar 2 )^! C— [Ax 1 — (L M {M'— B P '} p ) q ] m 

X (Vaiii); 

and either: 

(iia) for modified solid supports of formula (Vai) cleaving the C-S s bond between 
the a-carbon atom of the modified solid support of formula (Vai) and the solid support S s to form the 

5 ion of formula (I); 

(iib) for modified solid supports of formula (Vaii), either simultaneously or 
sequentially, cleaving the C-X bond between X and the a-carbon atom and cleaving the S s - - -Ar 1 
bond between the solid support and the Ar 1 group to form the ion of formula (I); or 

(iic) for modified solid supports of formula (Vaiii), either simultaneously or 
10 sequentially, cleaving the C-X bond between X and the a-carbon atom and cleaving the S s - - -Ai 2 

bond between the solid support and the Ar 2 group to form the ion of formula (I); 
where: 

X, Ar 1 , Ar 2 , IV, L M , M, M', n, m, p and q are as defined above; ' r ^ ' 

Ss is a solid support; 
15 C- - -S s comprises a cleavable bond between C and S s ; 

S s - - -Ar 1 comprises a cleavable bond between Ar 1 and Ss; and 

Ss- - -Ar 2 comprises a cleavable bond between Ar 2 and Ss. 
The cleavable bond of C- - -S s , S s - - -Ar 1 or S s - - -Ar 2 may be a covalent, ionic, hydrogen, dipole-dipole 
or van der Waals bond. 

20 The invention further provides a method of forming an ion of formula (I) comprising the steps of: 
(i) reacting a solid support of formula (IVbii) or (TVbiii): 



Ss 




^Ar* (L M {M} p ) q 

(Ar 2 ^ — C— [Ar 1 — (L M {M} p ) q ] m . 1 

X* 



(TVbii); 




i 



with a biopoiymer, B P , having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vbii) or (Vbiii), respectively: 




S S 



Arir— (L M {]Vr— Bp'} p ) q 



(Ar 2 ^— C— [Ar 1 — (Lm{M'— B P '} p ) q ] m .i 

X * (Vbii); 




S S 

"Ar 2 

(Ar 2 ) n .i C— [Ar 1 — (L M {M'-Bp'}p) q ] m 

x * (Vbiii); 

5 and either: 

(iia) for modified solid supports of formula (Vbii), either simultaneously or 
sequentially, dissociating X* from the- derivative of formula (Vbii) and cleaving the S s - - -Ar 1 . bond 
between the solid support and the Ar 1 group to form an ion of formula (T); or 

(iib) for modified solid supports of formula (Vbiii), either simultaneously or 
10 sequentially, dissociating X* from the derivative of formula (Vbiii) and cleaving the S s - - -Ar 2 bond 

between the solid support and the Ar 2 group to form an ion of formula (T); 

where: X*, Ar 1 , Ar 2 , B P \ Lm, M, M 1 , n, m, p, q, S s , C- - -S s , S s - - -Ar 1 and S s - - -Ar 2 are as defined 
above. 

The invention further provides a method of forming an ion of formula (T) comprising the steps of: 
15 (i) reacting a solid support of formula (IVaiv) or (IVbiv): 



{M^LmM^--^ S S 




Ar* (L M {M} p ) q .i 

(Ar 2 ),,— C— [Ar 1 — (L M {M} p ) q ] m .i 

X (TVaiv); 



.with a biopolymer, B P , having at least one group capable of reacting with M to form a covalent 
linkage, to provide a modified solid support of the formula (Vaiv) or (Vbiv), respectively: 

{Bp 1 — M , } p . 1 L M {M , -B P , > 



Ar 2 (Lj^M'-BpVq-i 

(Ar 2 ) n — C— [Ar 1 — (L M {lvr- B P , > p ) q ] m .i 

X 



(Vaiv); 



{B p '— M-lp-iLMiM^-Bp'} 

Ar* (L M {M , -Bp , > p ) q .i 

(Ar 2 ),,— C— [Ax 1 — (L M {M , -Bp'} p ) q ] m .i 

X* (Vbiv); 

5 and either: 

(iia) for modified solid supports of formula (Vaiv), cleaving the C-X bond 
between X and the a-carbon atom to form the ion of formula (T); or - ^ 

(iib) for modified solid supports of formula (Vbiv), dissociating X* from the 
derivative of formula (Vbiv) to form the ion of formula (I); 

10 where: 

X, X*, Ar 1 , Ar 2 , B P ', L M , M, M\ p, q, n, m, and S s are as defined above; 
M"- - -Ss comprises a bond between M" and Ss; and 

M" is the same as M except that S s is bound to a portion of M which does not form part of 

M\ 

15 In this embodiment of the invention, the solid support is bound to a part of group M" which does not 
go one to form the residue M\ Thus; the derivatised biopolymer will be released from the soiid 
support during the derivativisation step and an additional step of cleaving the biopolymer from the 
solid support is not required. 

The modified soKd supports" of formulae"(Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) or (Vbiv) may 
20 optionally be washed after step (i). 

The invention also provides solid supports of the formulae (IVai), (TVaii), (TVaiii), (TVaiy), (TVbii), 
(TVbiii) and (TVbiv), as defined above. Similarly, the invention provides modified solid supports of 
the formulae (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii), and (Vbiv), as defined above. 
The heterogeneous methods of the invention are particularly appropriate for synthetic Copolymers, 




m 



< Methods of Analysis 

The invention also provides a method for analysing a biopolymer, B P , comprising the steps of: 

(i) reacting the biopolymer Bp with a compound of formula (Ha) or (Eh) or a solid 
support of formula (IVai), (TVaii), (IVaiii), (IVaiv), (TVbii), CTVbiii) or (IVbiv); 
. 5 (ii) providing an ion of formula (I); and 

(iii) analysing the ion of formula (I) by mass spectrometry. 

The biopolymer will typically have been obtained using a preparative or analytical process. For 
example, it may have been purified using various separation methods (e.g. 1 -dimensional or 
2-dimensional, reverse-phase or normal-phase separation, by e.g. chromatography or electrophoresis) 
10 and the separation may be based on any of a number of characteristics (e.g. isoelectric point, 
molecular weight, charge, hydrophobicity, etc.). Typical methods include 2D SDS-PAGE , 2D liquid 
chromatography (e.g. Multidimensional Protein Identification Technology, MudPIT, or 2D HPLC 
methods). The separation method can preferably interface directly with the mass spectrometer. 

Known analytical techniques can thus be adapted or improved by the method of the invention. A 
15 particularly preferred method involves 2D-PAGE of a biopolymer, or mixture of biopolymers, 
selection of a spot of interest in the electrophoretogram, and then derivatisation and analysis of that 
spot using the techniques of the invention. The biopolymer may be proteolytically digested prior to 
its analysis (typically within the PAGE gel, but optionally digested after extraction from the gel) 
and/or may itself be the product of a proteolytic digest. 

20 The invention also provides, in a method for analysing a biopolymer, B P , the improvement consisting 
of: (i) reacting a biopolymer, Bp with a compound of formula (Ha) or (Tib) or a solid support of 
formula (IVai), (IVaii), (TVaiii), (IVaiv), (IVbii), (IVbiii) or (Wbiv); (ii) providing an ion of formula 
(I); and (iii) analysing the ion by mass spectrometry. 

Typically, the analysis by mass spectrometry is carried out in a spectrometer which is suitable for 
25 MALDI-TOF spectrometry. ... . - 

In the spectrometer, the ion source may be a matrix-assisted laser desorption ionisation (MALDI), an 
electrospray ionisation (ESI) ion source, a Fast-Atom Bombardment (FAB) ion source. Preferably, 
the ion source is a MALDI ion source. The MALDI ion source may be traditional MALDI source 
(under vacuum) or may be an atmospheric pressure MALDI (AP-MALDI) source. MALDI is a 
30 preferred ionisation method, although the use of a matrix is generally not required 

In the spectrometer, the mass analyser may be a time of flight (TOF), quadrupole time of flight 
(Q-TOF), ion trap (IT), quadrupole ion trap (Q-IT), triple quadrupole (QQQ) Ion Trap or Time-Of- 
Flight Time-Of-Flight (TOFTOF) or Fourier transform ion cyclotron resonance (FTICR) mass 
analyser. Preferably, the mass analyser is a TOF mass analyser. 



ft 



{ Further Embodiments 

M bound to B P ' by a non-covalent linker 

Tie above-mentioned embodiments of the invention may also be provided in which 
B,. by a non-covaient bond. All the other featiues of the invention are me same except me gronps 
5 which relate to me non-covalent bond between M' and B P '. 

* non-covalent bond may be direct between M and Bp' or ma, be provided by one or more 
binding groups present on M and/or B P '. 

Preferred non-oovalen, bonds are those having an association constant (K.) of at least 10» M" , 
preferably about 10 15 M" 1 . 

,0 m preferred embodiment, one of M 1 and V will have a binding group comprising biotin, and me 
other of M' and B P ' will have a binding group comprising avidin or streptav.din. 
MM*, when the compounds of the invention comprise a non-covalen, bond between M -W 
and a enable bond between C and Ss, Ax' and Ss, or At* and Ss, these bonds are dtffe^ly 
eieavable. More preferably, the non-cova.en, bond between M* end Br' ts not cieaved untie 

IS conditions which me cleavable bond between C and Ss, Ar' and S s , or A? and Ss. as appropnate^s 
cleaved. 

L M bound to Ar 1 by more than one bond 

Tie above-mentioned embodiments of me invention may a.so be provided in which L M is bound re 
Ar. by more man one covaUnt bond (e.g. 2 or 3 bonds) which are eiflter -■**»«•- 
2 „ H bonds, or one or more multiple bonds (a.g. donb.e or triple covalen, ««*^^ 
features of me invention are the same except the groups which relate ft tine bond or bonds between 

Ar 1 and Lm- 

Ionisatton of Compounds other them Biopolymers . 

b addition to biopolymers, fte present invention may be used for ionising any moleculeor complex 
25 of molecules which requires mass spectrum analyst, Thus, the above-mentioned embodnnents of the 

Lention may also be provided in which B, is rep.aced by any molecme or complex havmg a, e^ 
- Z group capable of reacting with M to form a cova.cn, linkage. AU the other feaatres of ft 

ftJuoTare the same, except group M is group capable of reacting with fte mo,ecu>e ft be 

analysed. 

30 Examples of other moleculea which may be analysed in fte present invention inchtde 

polyl (e.g. synftetic poryesrers, polities and polycarbonates), pe hochenu cala „d«* 
module* (e.g. alkanes, alkenes, amines, alcohols, e*era and amides). Amrnes are particurarly 



preferred. 





( Examples of complexes which may be analysed in the present invention include double- and triple- 
stranded RNA, DNA and/or peptide nucleic acid (PNA) complexes, enzyme/substrate complexes, 
multimeric proteins {e.g. dimers, trimers, tetramers, pentamers, etc.), virions, etc. 

Preferably, when the compound to be ionised is not a biopolymer, all embodiments of the invention 
5 (including products of formulae (T), (Ha), (lib), (ma), (mb), (TVai), (TVaii), (TVaiii), (TVaiv), (TVbii), 
(IVbiii), (TVbiv), (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv), methods of forming an ion 
of formula (T) and methods of analysis) involving or relating to the compound of formula (XI) are 
disclaimed. 



OMe 



solid support 




10 Preferred Embodiments 
Definition ofC* 

Preferably, C* bears a single positive charge such that ions of the invention are cations and the ion 
of formula (I) has the following structure: 

(Ar 2 )— C— [Ar 1 — (L M {M-— B P '} p ) q ] m 

15 and the compounds of formulae (lib), (mb), (TVbii), (TVbiii), (TVbiv), (Vbii), (Vbiii) and (Vbiv) 
have the structures disclosed in table 1 . 

n, m, p and q 

For the purposes of compounds of the invention having n-1 groups Ar 2 , n may net be less than 1 . • - 
Preferably n = 2 and m = 1 . 



20 Preferably p = 1, 2 or 3. Preferably p= 1 . 
Preferably q = 1 , 2 or 3. Preferably q = 1 . 

Preferably n = 2, m = 1, p = 1 and q = 1 . The ion of formula (T) thus has the structure: 




I and the compounds of formulae (Ha), (lib), (Ma), (HTb), (IVai), (TVaii), (IVaiii), (IVaiv), (TVbii), 
(TVbiii), (IVbiv), (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv) have the structures 
disclosed in table 2. 

Biopolymers 

5 The term 'biopolymer' includes polymers found in biological samples, including polypeptides, 
polysaccharides, and polynucleotides {e.g. DNA or RNA). Polypeptides may be simple copolymers 
of amino acids, or they may include post-translational modifications e.g. glycosylation, lipidation, 
phosphorylation, etc. Polynucleotides may be single-stranded (in whole or in part), double-stranded 
(in whole or in part), DNA/RNA hybrids, etc. RNA may be mRNA, rRNA or tRNA. 

10 Advantageous biopolymers are those which do not readily form a molecular ion in known 
MALDI-TOF MS techniques, especially those which do not form a molecular ion on illumination of 
laser light at 340 nm. 

Biopolymers for use in the invention comprise two or more monomers, which may be the same or 
different as each other. Preferred biopolymers comprise at least pp monomers, where pp is 5 or more 
15 (e.g. 6, 7, 8, 9, 10, 15, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 90, 100, 125, 150, 175, 200, 250). More 
preferred biopolymers comprise ppp or fewer monomers where ppp is 300 or less (e.g. 200, 1 00^50). 

Biopolymers may have a molecular mass of at least qq kDa, where qq = 0.5 or more (e.g. 0.6, 0.7, 
0.8, 0.9, 1, 1.5, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 25, 30, 40, 50, 75, 100, etc.). Preferred biopolymers 
are those having a molecular mass within the range of detection of a mass spectrometer. More 
20 preferred biopolymers have a molecular mass of qqq kDa or less, where qqq is 30 or less (e.g. 20, 10, 
5)- 

Preferably, the mass, m(IX), of the fragment (TX) 

(Ar 2 )— C— [Ar 1 -(L M {lVP} p ) q ] m 

* (IX) 

of the cation of formula (I) is significantly less than the mass, m(Bp'), of the biopolymer residue Bp 1 . 
25 For example the ratio m(B P ! ) / m(IX) is preferably more than nn, where nn is at least 2 (e.g. 3, 4, 5, 
10, 100, 1000, etc.). 

The invention is suitable for use with purified biopolymers or mixtures of biopolymers. For example, 
a pure recombinant protein could be derivatised and analysed by MS, or biopolymers within a 
cellular lysate or extract could be derivatives and then analysed. 

30 Preferred biopolymers are polypeptides. Particularly preferred biopolymers are polypeptides formed 
after proteolytic digestion of a protein. 



Biopolymers bound to solid supports 




( Bp is thus derivatised in situ while bound to the support, and is then released. As the biopolymer is 
bound to the solid support, this aspect of the invention is particular relevant to methods involving 
compounds of formulae (Ha) and (lib). 

The biopolymer may be bound to the solid support by a covalent, ionic, hydrogen, dipole-dipole or 
5 van derWaals bond (also known as a dispersion bond or a London forces bond). The covalent, ionic, 
hydrogen, dipole-dipole or van der Waals bond may be direct between the biopolymer and the solid 
support or may be provided by one or more binding groups present on the biopolymer and/or solid 
support. Preferred groups are non-covalent groups. 

Examples of groups which can form these types of bond, and methods for cleaving these types of 

1 0 bond, are set out below in connection with C- - -S s bonds, etc. 

In a particularly preferred embodiment, the solid support is provided with HNMe 3 ) + binding groups 
and the biopolymer has a net negative charge, or vice versa (i.e. the -(NMe 3 ) + is on the biopolymer). 
In other preferred embodiments, the solid support is provided with anions such as carboxylate, 
phosphate or sulphate, or anions formed from acid groups, and the biopolymer (e.g. a histone) has a 

1 5 net positive charge, or vice versa. 

" •" Reactivity with group M 

The biopolymers have at least one reactive group capable of reacting with M to form a covalent 
linkage. Such groups typically include naturally occurring groups and groups formed synthetically on 
the biopolymer. 

20 Naturally occurring groups include lipid groups of lipoproteins (e.g. myristoyl, 
glycosylphosphatidylinositol, ethanolamine phosphoglycerol, palmitate, stearate, S- or N- or O-acyl 
groups, lipoic acid, isoprenyl, geranylgeranyl, farnesyl, etc.), amide, carbohydrate groups of N- and 
O- glycoproteins, amine groups (e.g. on lysine residues or at the N-terminus of a protein), hydroxyl 
(e.g. in p-hydroxyaspartate, p-hydroxyasparagine, 5-hydroxylysine, y^hydroxyproline), thiol, 
25 sulfhydryl, phosphoryl, sulfate, methyl, acetyl, formyl- (e.g. on N-terminal methionines -from 
prokaryotes), phenyl, indolyl, guanidyl, hydroxyl, phosphate, methylthio, ADP-ribosyl etc. 
The reactive group is bound to the biopolymer by one or more covalent bonds (e.g. 2 or 3 bonds), 
which are either single, double or triple covalent bonds (preferably single bonds). Preferably, the 
reactive group is bound to the biopolymer by one single bond. 
30 Groups which may be formed naturally or synthetically on the biopolymer and which are bound to 
the biopolymer by one bond include: -NR 2 e.g. -NHR, especially -NH 2 ; -SR e.g. -SH; -OR e.g. -OH; 
-B(R)Y; -BY 2 ; -C^Y; -C(R)Y 2 ; -CY 3 ; -C(=Z)Y e.g. -C(=0)Y; -Z-C(=Z)Y; -C(=Z)R e.g. -C(=Z)H, 
especially -C(=0)H; -C(R)(OH)OR; -C^XOR).; -S(=0)Y; -^S(=0)Y; -S(=0)aY; -Z-S(-0) 2 Y; 
S(=0) 3 Y; -2>S(=0) 3 Y; -P(=Z)(ZR)Y e.g. -P(=0)(OH)Y; -P(=Z)Y 2 ; -Z-P(=Z)(ZR)Y; WW* 




! Groups which may be formed synthetically on the biopolymer and which are bound to the 
biopolymer by two bonds include -N(R)- e.g. -NH-; -S-; -0-; -B(Y>; -C(R)(Y)-; -CY 2 -; -C(=0)-; 
-C(OH)(OR)-; -C(OR) 2 -. 

Groups which may be formed synthetically on the biopolymer and which are bound to the 

I 

5 biopolymer by three bonds include CQO . 

Preferred groups include nucleophilic groups, either natural or synthetic, e.g.: -NR 2 e.g. -NHR, 
especially -NH 2 ; -SR e.g. -SH; -OR e.g. -OH; -N(R> e.g. -NH-; -S-; and -O-. The groups -NH 2 , -SH 
and -OH are particularly preferred. 

Another preferred reactive group is maleimidyl: 




10 

Y is independently a leaving group, including groups capable of leaving in an SN 2 'substitution- 
reaction or being eliminated in an addition-elimination reaction with the reactive group of the 
biopolymer B P . Preferred examples of Y include halogen (preferably iodo), Ci-ghydrocarbyloxy (e.g. 
C]_ 8 alkoxy), CVghydrocarbyloxy substituted with one or more A, Ci.gheterohydrocarbyloxy, 
15 Ci-gheterohydrocarbyloxy substituted with one or more A, mesyl, tosyl, pentafluorophenyl, 
-O-succinimidyl (formula VII), -S-succinimidyl, or phenyloxy substituted with one or more A e.g. p- 
nitrophenyloxy (formula VIII). 



o 



(vn) 




N0 2 



(vm) 



Z is independently O, S or N(R). Preferred (=Z) is (=0). 



20 R is independently H, Ci_ghydrocarbyl (e.g. C^alkyl) or Ci-ghydrocarbyl substituted with one or 
more A. 

R is preferably H. 

Other groups which may be formed naturally or synthetically on the biopolymer include groups 
capable of reacting in a cycloaddition reaction, especially a Diels-Alder reaction. 




mum 





and multivalent derivatives formally formed by removal of one or more hydrogen atoms, where A 1 is 
-R 1 or -Z^R 1 , where R 1 and Z 1 are defined below. 

Preferred dienophile groups are -CR 1 ^, -CR^CCRV, ^A 2 =CR> 2 , -CA^CCR^A 2 or 
5 -CA 2 =CA 2 2 , and multivalent derivatives formally formed by removal of one or more, hydrogen 
atoms where R 1 is defined below and A 2 is independently halogen, trihalomethyl, -NO* -CN, 
.^O" -C0 2 H, -C0 2 K\ -SO3H, -SOR 1 , -S0 2 R\ -SO3R 1 , -OCC^OR 1 , -C(=0)H, -C(=0)R\ 

-oc^or 1 , , -oc^conrS, -ncr^or 1 , -c(=s)m} 2 , -nr x c(=s)r\ -so 2 m} 2 , -nr so 2 r , 

-N(R 1 )C(=S)NR 1 2 , or -N(R 1 )S0 2 NR 1 2 , where R 1 is defined below. A particularly preferred dienophile 
10 group is maleimidyl. 

GroupM .. 

The group M is capable of reacting with the reactive group of the biopolymer, Bp, to form a covalent 
linkage. [Group 'M' is shown as 'AFG* in the drawings]. 

The group M is bound to L M by one or more covalent bonds {e.g. 2 or 3 bonds, especially 2 such 

15 as L V-J^), which are either single, double or triple covalent bonds (preferably single bonds). 
Preferably, M is bound to L M by one single bond. 

Examples of group M bound to L M by one bond include -NR 2 e.g. -NHR, especially -NH 2 ; -SR e.g. 
-SH; -OR e.g. -OH; -B(R)Y; -BY 2 ; -C(R) 2 Y; -C(R)Y 2 ; -CY 3 ; -C(=Z)Y e.g. -C(=0)Y; -Z-C(-Z)Y; 
-C(=Z)R e.g. -C(=Z)H, especially -C(=0)H; -C(R)(0H)OR; -C(R)(OR) 2 ; -S(=0)Y; ; -Z-S(-0)Y; 
20 -SCO^Y; -Z-SC^Y; -S(=0) 3 Y; -Z-S(=0) 3 Y; -P(=Z)(ZR)Y e.g. -P(=0)(OH)Y; -P(-Z)Y 2 ; 
-Z-P(=Z)(ZR)Y; -Z-P(=Z)Y 2 ; -P(=Z)(R)Y e.g. -P(=0)(H)Y; -Z-P(=Z)(R)Y; or -N=C(=Z) e.g. 
-N=C(=0). 

Examples of group M bound to L M by two bonds include -N(R)- e.g. -NH-; -S-; -O-; -B(Y)-; 
-C(R)(Y)-; -CY 2 -; -C(=0)-; -C(OH)(OR)-; -C(OR)r. 

25 Examples of group M bound to L M by three bonds include C 00 

Preferred groups M include electrophilic groups, especially those susceptible to SN 2 substitution 
reactions, addition-elimination reactions and addition reactions, e.g. -B(R)Y;' -BY,; "CCR^Y; 




i -P(=Z)(ZR)Y e.g. -P(=0)(OH)Y; -P(=Z)Y 2 ; -Z-P(=Z)(ZR)Y; -Z-P(=Z)Y 2 ; -P(=Z)(R)Y 

-P(=0)(R)Y; -Z-P(=Z)(H)Y; -N=C(=Z) -N=C(=0); -B(Y>; -C(R)(Y)-; -CY 2 -; -C(=0)-; 

I 

-C(OH)(OR)-; -C(OR) 2 -; or — C 00 . 

Another preferred group M is maleimidyl. 

5 Y, Z and R are defined as above. 

Particularly preferred groups M include -C(=0)Y, especially -C(=0)-0-succinimidyl and 
-C(=0)-0-(p-nitrophenyl). 

Other groups M include groups capable of reacting in a cycloaddition reaction, especially a Diels- 
Alder reaction. 

10 In the case of Diels-Alder reactions, the reactive group on the biopolymer is either a diene or a 
dienophile. Preferred diene groups are 




and multivalent derivatives formally formed by removal of one or more hydrogen atoms, where A 1 is 
-R 1 or -Z l R\ where R 1 and Z 1 are defined below. [ 

15 Preferred dienophile groups are -CR l =CR} 2 , -CR^CCR^A 2 , -CA^CR 1 ^ -CA^CCR^A 2 or 
-CA 2 =CA 2 2 , and multivalent derivatives formally formed by removal of one or more hydrogen 
atoms, where R 1 is defined below and A 2 is independently halogen, trihalomethyl, -N0 2 , -CN, 
-N f (R 1 ) 2 0 - , -C0 2 H, -C02R 1 , -SO3H, -SOR 1 , -S0 2 R l , -SO3R 1 , -OC(=0)OR 1 , -C(=0)H, -C(=0)R\ 
-OCC^R 1 , , -OCC-CONR 1 ^ -NCR^C^OjR 1 , -C(=S)NR 1 2 , -NR 1 C(=S)R 1 , -SC^NR 1 ^ -NR^OaR 1 , 

20 -NCR^CC^NR^, or -NCR^SOsNR^, where R 1 is defined below. A particularly preferred dienophile 
group is maleimidyl. 

Matching B P and M 

The reactive group on the biopolymer [shown as -*P in the drawings] and the group M [shown as 
' AFG r in the drawings] must be dependency selected in order to form the covalent linkage.. For 
25 example, where the biopolymer includes the groups -NH 2 , -OH or -SH, M will typically be -B(R)Y; 
-BY 2 ; -C(R)2Y; -C(R)Y 2 ; -CY 3 ; -C(=Z)Y e.g. -C(=0)Y; -Z-C(=Z)Y; -C(=Z)R e.g. -C(=Z)H, 
especially -C(=0)H; -C(R)(OH)OR; -C(R)(OR) 2 ; -S(=0)Y; -Z-S(=0)Y; -S(=0) 2 Y; -Z-S(==0)2Y; 




0 



( -P(=Z)(R)Y e.g. -P(=0)(H)Y; -Z-P(=Z)(R)Y; -N=C(=Z) e.g. -N=C(=0); -B(Y>; -C(R)(Y)-; -CY 2 -; 

■ " J _ 

_ C (=0)s -C(OH)(OR)-; -C(0R)2-; or C <?> - 

In a preferred embodiment, one of the reactive group on the biopolymer and group M is a maleimidyl 
and the other will be a -SH group. 
5 Alternatively, when the covalent linkage is to be formed by a Diels Alder reaction, one of the 
reactive group on the biopolymer and group M will typically be a diene and the other will be a 
dienophile. 



M 


Group on B P 


Obtained Linkage M*-Bp* 


-C(=0)-0-succinimidyl [i.e. carboxy-NHS] 


-NH 2 


-CO-NH- 


-C(=0)-0-(p-nitrophenyl) 


-NH 2 


-CO-NH- 


-C(=0)-pentafluorophenyl 


-NH 2 


-CO-NH- 


Biotin 


avidin / streptavidin 


biotin-(strept)avidin 




-SH 




& 

o 




5 O 


-N _ C=S (isothiocyanate) 


-NH 2 


-NH-CS-NH- 



10 The covalent residue M'-Bp' is the reaction product of M and B P . B P ' will generally be the same as B P 
except that instead of the reactive group, B P ' will have a residue of the reactive group covalently 
bound to the residue M>. Depending on the choice of the reactive group and the choice of M, M' and 
the residue of the reactive group will typically form linkages, in the orientation L M -M'-B P ', including 
. -C(R) 2 Z-, -ZC(R) 2 -, -C(=Z)Z-, -ZC(=Z>, -ZC(-Z)Z-, -C(OH)(R)Z-, -ZC(OH)(R)-,-C(RXOR)Z-, :~ 
15 -ZC(R)(OR>, -C(R)(OR)Z-, -ZC(R)(OR)-, -S(=0)Z-, -ZS(=0)-, -ZS(=0)Z-, -S^Z-, -ZS(-0) 2 -, 
-ZS(=0) 2 Z-, -S(=0) 3 Z-, -ZS(=0) 3 -, -ZS(=0)3Z-, -P(=Z)(ZR)Z-, -ZP(=Z)(ZR>, -ZP(=Z)(ZR)Z-, 
-P(=Z)(R)Z-, -ZP(=Z)(R)-, -ZP(=Z)(R)Z-, -NH-C(=Z)-Z-, where Z and R are as defined above. 

Group M" 

M" is the same as M except that the group S s is bound to a portion of M which does not form part of 
20 M. Thus, M" is a residue of M formable by the conjugation of M and S s . However, M" need not 
necessarily be formed by the conjugation of M and S s . 

M"- - -S s comprises a covalent, ionic, dipole-dipole, hydrogen, or van der Waals bond. The covalent, 




\ Examples of groups which can form these types of bond, and methods for cleaving these types of 
bond, are set out below in connection with C- - -S s bonds, etc. 

This embodiment of the invention is advantageous, since the derivativisation of the biopolymer will 
also release the derivatised biopolymer from the solid support. Thus, an additional step of cleaving 
5 the biopolymer from the solid support is not required. 

Preferred groups M" are groups M having a leaving group, wherein the group S s is bound to the 
leaving group, e.g. groups M mentioned above having a leaving group Y, wherein the group S s is 
bound to the leaving group Y. 

A particularly preferred group M" is: 



10 




Where the group L M is a linker atom or group, it has a sufficient number of linking covdlfent bonds to 
link L M to the group Ar 1 by a single covalent bond (or more, as appropriate) and to link L M to the p 
instances of M (or M 1 , as appropriate) groups (which may be attached to L M by one or more bonds). 

15 The group L M may be directly bound to the aromatic part of Ar 1 , bound to one or more of the 
substituents. A of Ar 1 , or both. Preferably, L M is bound directly to the aromatic part of Ar 1 . 

When L M is a linker atom, preferred linker atoms are O or S, particularly O. 

When L M is a linker group, preferred linker groups, in the orientation Ar ! -(LM{M}p) q or 
Ar 1 -(L M {M , } p ) q , as appropriate, are -E M -, -(D M ) t -, -(E M -D M ) r , -(D M -E M ) r , -E M -(D M -E M ) t - or 
20 -D M -(E M -D M )t-, where a sufficient number of linking covalent bonds, in addition to. the covalent 
bonds at the chain termini shown, are provided on groups E M and D M for linking the p instances of M 
(or M') groups. 

- D M is independently Ci-shydrocarbylene or Q-ghydrocarbylene substituted with one or mofe~A7 

E M , in the orientation Ar 1 -(L M {M} p ) q or Ar 1 -(L M {M , } p ) q , as appropriate, is independently -Z M -, 
25 -C(=Z M )-, -Z M C(=Z M >, -C(=Z M )Z M -, -Z M C(=Z M )Z M -, -S(=0)-, -Z M S(=0)-, -S(=0)Z M -, 
-Z M S(=0)Z M -, -S(=0) 2 -, -Z M S(=0) 2 -, -S(=0) 2 Z M -, -Z M S(=0)2Z M -, where Z M is independently O, S or 
N(R M ) and where R M is independently H, C^hydrocarbyl (e.g. d. 8 alkyl) or Ci^hydrocarbyl 
substituted with one or more A. Preferably E M is, in the orientation Ar 1 -(L M {M} p ) q or 
Ar 1 -(L M {M , } p ) q , as appropriate, -O-, -S-, -C(=0>, -C(=0)0-, -C(=S)-,. -C(=S)0-, -OC(=S>, 



i -SC(=0)0-, -OC(=0)S-; -N(R M )C(=0)0-, -OC(=0)N(R M )-, -N(R M )C(=0)N(R M K 
" -N(R M )C(=S)N(R M )-, -N(R M )S(=0)N(R M )- or -Nfr M )S(<>) 2 N(R M )-. 

t = 1 or more, e.g. from 1 to 50, lto 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t = 1 , 2, 3, 4, 5, 6, 7, 8, 
9, or 10. 

5 Preferably, L M links one group M (or M') to Ar\ M (or MO is linked to L M by a single covalent bond 
and therefore no additional bonds are required (e.g. L M {Mh may be -E M -{M>, -(D M ) t -{M}, 
-(E^t-iM), -(D M -E M ) t -{M>, -E M -(D M -E M ) t -{M} or -D M -(E M -D M )t-{M}). 
Where L M includes a group which also falls within the definition of group M, the group M is 
preferably more reactive than the group included in L M . 



10 Lm is preferably -(D M ) t -> -(E M -D M ) t -, or -D M -(E M -D M ) t - 



When group L M is -(D M ) t -» t is preferably 1. D M is preferably C^alkylene, preferably methylene or 
ethylene. 

When group L M is -(E^DV, or -D M -(E M -D M ) t - 3 E M is preferably (in the orientation Ar^Mp),, 
or Ar'-<L M {M'} p ) q , as/appropriate), -C(=0)N(R M )- (e.g. -C(=0)NH-) or O, and D M is preferably 
15 C^alkylene, preferably ethylene or propylene. The group -(E M -D M ) t - is preferred, a particularly 
preferred example of which is (in the orientation Ar 1 -(L M {M} p ) q or ^-(LmIM'}^, as appropriate) 
-C(=0)NH-CH 2 CH 2 CH 2 -0-CH 2 CH 2 -0-CH 2 CH 2 -0-CH 2 CH 2 CH 2 -. 

It is also preferred that L M is a single covalent bond. 

When Ar 2 is phenyl, L M is preferably provided in a position ortho or para to C*. When Ar 2 is other 
20 than phenyl, L M is preferably attached to an atom which bears the charge in at least one of the 
resonance structures of the ions of formula (I). 

Where C* is a cation, L M is preferably an electron-donating group. Where is an anion, L M is 
preferably an electron-withdrawing group. 

C- - -S s , Ss- - -Ar 1 and Ss- - Ar 2 Bonds 
25 C- - -S S , S s - - -Ar 1 and S s - - -Ar 2 comprise a cleavable covalent, ionic, hydrogen, dipole-dipole or van 
der Waals bond (also known as a dispersion bond or a London forces bond). The covalent, ionic, 
hydrogen, dipole-dipole or van der Waals bond may be direct between C and S s , Ar 1 and S s , or Ar 2 
and S S , or may be provided by one or more binding groups present on C and/or S s , Ar 1 and/or S s , or 
Ar 2 and/or S s , respectively. 

30 Covalent Bonding 

Where the bond is covalent, 




( "When L 4 is a linker group, preferred linker groups are -E 4 -, -(D 4 ) t »-, -(E 4 -D 4 ) t »-, -(D 4 -E 4 ) t <-, 
-E*-(D*-EV- or -D 4 -(E 4 -D 4 ) t -. 

D 4 is independently Ci. 8 hydrocarbylene or Ci-ghydrocarbylene substituted with one or more A. 

E 4 is, in the orientation C-L 4 -S s , independently -Z 4 -, -C(=Z 4 )-, -Z 4 C(=Z 4 )-, -C(=Z 4 )Z 4 -, -Z 4 C(=Z 4 )Z 4 -, 
5 -S(=0)-, -Z 4 S(=0)-, -S(=0)Z 4 -, -Z 4 S(=0)Z 4 -, -S(=0) 2 -, -Z 4 S(=0>2-, -S(=0) 2 Z 4 -, -Z 4 S(=0) 2 Z 4 -, where 
Z 4 is independently O, S or N(R 4 ), and where R 4 is independently H, Ci^hydrocarbyl (e.g. Ci. 8 alkyl) 
or Ci-shydrocarbyl substituted with one or more A. Preferably E 4 is, in the orientation C-L 4 -S s , -O-, 
-S-, -C(=0)-, -C(=0)0-, -C(=S)-, -C(=S)0-, -OC(=S)-, -C(=0)S-, -SC(=0)-, -S(0)-, -S(0) 2 -, 
-N(R 4 )-, -C(=0)N(R 4 )-, -C(=S)N(R 4 )-, -N(R 4 )C(=0)-, -N(R 4 )C(=S)-, -S(=0)N(R 4 )-, -N(R 4 )S(=0)-, 
10 -S(=0) 2 N(R 4 )-, -N(R 4 )S(=0) 2 -, -OC(=0)0-, -SC(=0)0-, -OC(=0)S-, -N(R 4 )C(=0)0-, 
-OC(=0)N(R 4 )-, -N(R 4 )C(=0)N(R 4 )-, -N(R 4 )C(=S)N(R 4 )-, -N(R 4 )S(=0)N(R 4 )- or - 
N(R 4 )S(=0) 2 N(R 4 )-. 

t" = 1 or more, e.g. from 1 to 50, lto 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t" = 1, 2, 3, 4, 5, 6, 7, 
8,9, or 10. 

15 Where L 4 includes a group which also falls within the definition of group M, the ; • group M is 
preferably more reactive than the group included in L . 

L 4 is preferably a linker atom, preferably O or S, particularly O. : 

When the solid support S s is gold, L 4 is preferably covalently attached to the S s by a sulphide or 
disulphide group. •;. 

20 Ionic Bonding :i - c - : 

Where the bond is ionic, the bond is typically direct (e.g. C* Ss*, where Ss* is a solid support 
counterionto C*). 

Alternatively, it may be provided by binding groups, e.g. chelating ligands, present on C or Ss, Ar 1 or 

Ss, or Ar 2 or Ss, respectively. In the case of C Ss bonds, the chelating ligand is typically only 

25 present on Ss and chelates with C*. 

Suitable chelating ligands which can bind anions include polyamines and cryptands, „ ... 

Suitable chelating ligands which can bind cations include polyacidic compounds (e.g. EDTA) and 
crown ethers. 

Hydrogen Bonding 



30 Where the bond is a hydrogen bond, the bond is usually provided by binding groups present on C or 
Ss, Ar 3 or S s , or Ar 2 or Ss, respectively. 




I of C or S S , Ar 1 or S s , or Ar 2 or S s , respectively, will have a binding group bearing an atom having 
one or more lone pair of electrons {e.g. an oxygen, sulphur or nitrogen atom). Preferably, one of C or 
S S Ar 1 or S s , or Ar 2 or S s , as appropriate, will have a binding group comprising biotin, and the other 
of C or S S , Ar 1 or S s , or Ar 2 or S s , respectively, will have a binding group comprising avidin or 

5 streptavidin. 

Alternatively, the hydrogen bond may be direct. 
Dipole-Dipole Bonding 

Where the bond is a dipole-dipole bond, it may be formed between permanent dipoles or between a 
permanent dipole and an induced dipole. 

10 Typically, in order to form the dipole-dipole bond, one of S s and the compound of the invention has 
a permanent dipole and the other of S s and the compound of the invention has an induced dipole or a 
permanent dipole, the attraction between Ihe dipoles forming a dipole-dipole bond. 
Preferably, S s comprises binding groups (e.g. acid groups, -(NMe 3 ) + , carboxy, carboxylate, 
phosphate or sulphate groups) which produce a dipole at the surface of the solid support to bmd the 

15 compound of the invention. 

Van der Waals Bonding 

Where the bond is a van der Waals bond, the bonding is usually provided by binding groups present 
on C or S Si Ar 1 or S s , or Ar 2 or S s , respectively. 

Typically, in order to form the van der Waals bond, at least one, but preferably both, of C or S s , Ar 1 
20 or S S or Ar 2 or S s , as appropriate, will have a hydrocarbyl or heterohydrocarbyl group (usually a 
large hydrocarbyl group having at least ten carbon atoms up to about 50 carbon atoms), optionally 
substituted with one or more A. Polyfluorinated hydrocarbyl and heterohydrocarbyl groups are 
particularly preferred. Typically, the hydrocarbyl or heterohydrocarbyl groups are aryl or heteroaryl 
groups or groups of the formula -C^Ar 3 , -C^Ar 3 ^ or -C{A^h, where Ar 3 is independently 
25 defined the same as Ar 2 and R 6 is H, C,. 8 hydrocarbyl, Cm hydrocarbyl substituted by one or more 
A 5 C M heterohydrocarbyl or Cm heterohydrocarbyl substituted by one or more A. 
A preferred binding group is tetrabenzofullerene (formula X). 

(formula X) 




Alternatively, the van der Waals bond may be direct 





■V. ■• J- h'-;M%^K>* * 



Bond Cleavage 



10 



15 



20 



25 



C-XBonds 

The C-X bonds are cleavable . by irradiation, electron bombardment, electrospray, fast atom 
bombardment (FAB), inductively coupled plasma (ICP) or chemical ionisation. Preferably, the C-X 
bonds are cleavable by irradiation or chemical ionisation. 

Hie term * irradiation' includes, for example, laser illumination, in particular as used in MALDI mass 
spectrometry. Laser light of about 340 nm is particularly preferred because it is typically used in 
MALDI mass spectrometers. 

The term 'electron bombardment' includes, for example, bombardment with electrons having energy 
of about 70 ev. 

Chemical ionisation can be effected, for example, by treatment with acid or acidic matrices (e.g. 
acidic matrices used in MALDI analysis). 

Preferably group X is halogen, hydroxy, Ci. 8 hydrocarbyloxy, Ci_ 8 hydrocarbyloxy substituted with 
one or more A, Ci-gheterohydrocarbyloxy, Ci.gheterohydrocarbyloxy substituted with one or more A, 
mesyl, tosyl, pentafluorophenyl, -O-succinimidyl -S-succinimidyl, or phenyloxy substituted with one 
or more A e.g. p-nitrophenyloxy. The groups pentafluorophenyl, -O-succinimidyl, -S-succinimidyl, 
and p-nitrophenyloxy are particularly preferred. , fc 

Group X may also be a -Q-oligonucleotide, where Q is O, S or N(R), where R is H, Ci^hydrocarbyl 
or Ci-ghydrocarbyl substituted with one or more A. Q is preferably O. , 

Group X may also be a nucleoside, preferably where the nucleoside is bound via its 5' end.. 

In some embodiments of the invention, where B P is an antibody (particularly where it is a 
monoclonal antibody that recognises a tumour-associated antigen), X is not: 




»-vK N " 

HO F 

or, optionally, X is not any other 2,6-diaminopurine nucleoside prodrug group. 

In some embodiments of the invention, X is not H. If X is H, preferably at least one of Ar 1 and Ar 2 is 
polycyclic, heterocyclic or unsubstituted. 



Ionic C *X* Bonds 




i" X* includes ions having single charges and multiple charges. Typically ions having multiple 
charges will be associated with an appropriate number of compounds of formula (TIb)/(IIIb), (TVbii), 
(IVbiii), (TVbiv), (Vbii), (Vbiii) or (Vbiv) in order balance the charge. Ions having multiple charges 
include doubly charged ions (e.g. SO* 2 ') and triply charged ions. X* preferably has a single charge. 

5 The counterion X* may be dissociated from the derivative of formula (lib), (IHb), (TVbii), (TVbiii), 
(TVbiv), (Vbii), (Vbiii) or (Vbiv) by irradiation, electron bombardment, electrospray, fast atom 
bombardment (FAB), inductively coupled plasma (LCP) or chemical ionisation. Preferably, the 
counterion X* may be dissociated by irradiation. 
When X* is a cation, X* is preferably H 4 ". 

10 When X* is an anion, X* is preferably, BF 6 " or C10 4 '. 
It is preferred that X* is an anion. 

C-'-Ss.Ss-'-Ar 1 orSs—Ar 2 

The C- - -S s , S s - - -Ar 1 or S s - - -Ar 2 bonds are cleavable by irradiation, electron bombardment, 
electrospray, fast atom bombardment (FAB), inductively coupled plasma (TCP) or chemical 
15 ionisation. Preferably, the.C- - : Ss, S s - - -Ar 1 or S s; - -Ar 2 bonds are cleavable by irradiation or chemical 
ionisation. 

Where appropriate, the C- - -S s , S s - - -Ar 1 or S s - - -Ar 2 bonds may be cleaved simultaneously or 
sequentially with the cleaving of the C-X bond or the dissociation of X*, as appropriate, by 
selection of suitable cleaving/dissociating conditions. 
20 In one embodiment of the invention, the C- - -S s bond in the solid support of formula (Vai) may be 
cleaved in sub-steps of step (iia) so that in a first sub-step a residue X (where X is the leaving group 
defined above) is provided and in a second subsequent sub-step the C-X bond is cleaved thereby 
forming the ion of formula (I). If desired, the second sub-step may be carried out substantially (e.g. 
seconds, minutes, hours or even days) after the first sub-step. 

25 Ar 1 andAr 2 

A? 

Ar 2 is independently an aromatic group or an s-sgssfe group substituted with one or~more A and is 
preferably independently cyclopropyl, cyclopropyl substituted with one or more A, aryl, aryl 
substituted with one or more A, heteroaryi, or heteroaryl substituted with one or more A. 
30 Where aryl or substituted aryl, Ar 2 is preferably C^o aryl or substituted Qwo aryl. Where heteroaryl 
or substituted heteroaryl, Ar 2 is preferably C^o heteroaryl or substituted C«o heteroaryl. 
Examples of aryl and heteroaryl are monocyclic aromatic groups (e.g. phenyl or pyridyl), fused 





i or fused polycyclic aromatic groups linked by a single bond, a double bond, or by a -(C=C) r - linking 
group, where r is one or more (e.g. 1, 2, 3, 4 or 5). 

Ar 2 is preferably C6-3 0 aryl substituted by one or more A, preferably phenyl or napthyl substituted by 
one or more A, more preferably phenyl substituted by one or more A. When Ar 2 is phenyl, A is 
5 preferably provided in a position ortho or para to C * . When Ar 2 is other than phenyl, A is preferably 
attached to an atom which bears the charge in at least one of the resonance structures of the ions of 
formula (I). 

When substituted, Ar 2 is preferably substituted by 1, 2 or 3 A. Ar 2 is preferably: 



OMe 




1 0 When unsubstituted, Ar 2 is preferably: 




Ar 1 : • 

Ar 1 is independently an aromatic group or an aromatic group substituted with one or more A. The 
15 definition of Ar 1 is the same as Ar 2 (as defined above), except that the valency of the group Ar 1 is 
' " adapted to accommodate the q instances of the linker L M . 

When q = 1, Ar 1 is a divalent radical and is preferably independently cyclopropylene, cyclopropylene 

substituted with one or more A, arylene, arylene substituted with one or more A, heteroarylene, or 

heteroarylene substituted with one or more A. 



20 Where arylene or substituted arylene, Ar 1 is preferably C W o arylene or substituted Ce-so arylene. 
Where heteroarylene or substituted heteroarylene, Ar 1 is preferably Q5.30 heteroarylene or substituted 




( Examples of arylene and heteroarylene are monocyclic aromatic groups (e.g. phenylene or 
pyridylene), fused polycyclic aromatic groups (e.g. napthylene) and unfiised polycyclic aromatic 
groups (e.g. monocyclic or fused polycyclic aromatic groups linked by a single bond, a double bond, 
or by a ~(C=C) r - linking group, where r is one or more (e.g. 1, 2, 3, 4 or 5). 

5 Ar 1 is preferably C 6 -3oarylene substituted by one or more A, preferably phenylene or napthylene 
substituted by one or more A, more preferably phenylene substituted by one or more A. When Ar 1 is 
phenylene, A is preferably provided in a position ortho or para to C*. When Ar 1 is other than 
phenylene, A is preferably attached to an atom which bears the charge in at least one of the resonance 
structures of the ions of formula (I). 

10 When substituted, Ar 1 is preferably substituted by 1, 2 or 3 A. 

When unsubstituted, preferred Ar 1 are: 




Combinations of Ar 

Optionally two or three of the groups Ar 1 and Ar 2 are linked together by one or more L 5 , where L 5 is 
15 independently a single bond or a linker atom or group; and/or two or three of the groups Ar 1 and Ar 2 
together form an aromatic group or an aromatic group substituted with one or more A. 

When L 5 is a linker group, preferred linker groups are -E 5 -, -(D 5 )t-, -(E 5 -D 5 ) t -, -(D 5 -E 5 ) t - 9 
-E 5 -(D 5 -E 5 ) t '- or -D 5 -(E 5 -D 5 ) t -. 

D 5 is independently Ci^hydrocarbylene or Ci^hydrocarbylene substituted with one or more A. 

20 E 5 is independently -Z 5 -, -C(=Z 5 )-, -Z 5 C(=Z 5 )-, -C(=Z 5 )Z 5 -, -Z 5 C(=Z 5 )Z 5 -, -S(=0)-, -Z 5 S(=0)-, 
. . ..-S(=0)Z 5 -, -Z 5 S(=0)Z^-S(=0) 2 -, -Z 5 S(=O) 2 -, r S(=0) 2 Z 5 -, -Z 5 S(=0) 2 Z 5 -, where Z 5 . is .independently. _. 
O, S or N(R 5 ) and where R 5 is independently H, C^ghydrocarbyl or C^ghydrocarbyl substituted with 
one or more A. Preferably E 5 is -O-, -S-, -C(=0)-, -C(=0)0-, -C(=S)-, -C(=S)0-, -OC(=S)-, 
-C(=0)S- 5 -SC(=0>, -S(O)-, -S(0)2-, -N(R 5 )-, -CC^NCR 5 )-, -C(=S)N(R 5 )-, -N(R 5 )C(=0)-, 
25 -N(R 5 )C(=S)-, -S(=0)N(R 5 )-, -N(R 5 )S(=0)-, -S(=0) 2 N(R 5 )-, -N(R 5 )S(=0) 2 -, -OC(=0)0-, 
-SC(=0)0-, -OC(=0)S-, -N(R 5 )C(=0)0-, -OC(=0)N(R 5 )-, -N(R 5 )C(=0)N(R 5 )-, -N(R 5 )C(=S)N(R 5 )-, 
-N(R 5 )S(=0)N(R 5 >or-N(R 5 )S(=<))2N(R 5 )-. ■ 

t = 1 or more, e.g. from 1 to 50, lto 40, 1 to 30, 1 to 20 or 1 to 10. Preferably t l = 1, 2, 3, 4, 5, 6, 7, 8, 
9, or 10. 



30 Where L 5 includes a group which also falls within the definition of group M, the group M is 




>' When two of the groups Ar 1 and Ax 2 are linked together by one or more (e.g. 2, 3 or 4) L 5 , they are 
preferably linked together by one L 5 , preferably O. 

When two or three of the groups Ar 1 and Ai 2 together form an aromatic group or an aromatic group 
substituted with one or more A, the aromatic group may be a carbocyclic aromatic group or a 
5 carbocyclic aromatic group in which one or more carbon atoms are each replaced by a hetero atom. 
Typically, in an aromatic group in which one or more carbon atoms are each replaced by a hetero 
atom, up to three carbons are so replaced, preferably up to two carbon atoms, more preferably one 
carbon atom. 

Preferred hetero atoms are O, Se, S or N, more preferably O, S or N. 

10 When two or three of the groups Ar 1 and Ar 2 together form an aromatic group or an aromatic group 
substituted with one or more A, preferred aromatic groups are Cg-50 aromatic groups. 
The aromatic groups may be monocyclic aromatic groups (e.g. radicals of suitable valency derived 
fiom benzene), fused polycyclic aromatic groups (e.g. radicals of suitable valency derived from 
napthalene) and unfused polycyclic aromatic groups (e.g. monocyclic or fused polycyclic aromatic 

15 groups linked by a single bond, a double bond, or by a -(C=C) r - linking group, where r is one or 
more <e.g. 1,2, 3, 4 or 5). ^ 
When two or three of the groups Ar 1 and Ar 2 together form a carbopolycyclic fused ring aromatic 
group, preferred groups are radicals of suitable valency obtained from napthalene, anthracene or 
phenanthracene, chrysene, aceanthrylene, acenaphthylene, acephenanthrylene, azulene, fluoranthehe, 

20 fluorene, as-indacene, j-indacene, indene, phenalene, and pleiadene. £ 
When two or three of the groups Ar 1 and Ar 2 together form a carbopolycyclic fused ring aromatfc 
group in which one or more carbon atoms are each replaced by a hetero atom, preferred groups are 
radicals of suitable polyvalency obtained from acridine, carbazole, p-carboline, chromene, cinnoline, 
indole, indolizine, isobenzofuran, isochromene, isoindole, isoquinoline, naphthyridine, perimidine, 

25 phenanthridine, phenanthroline, phenazine, phthalazine, pteridine, purine, pyrrolizine, quinazoline, 
quinoline, quinolizine and quinoxaline. 

Substitution qfAr 1 andAr 2 -Anions and Catiom 



When C* is a cation, A is preferably an electron-donating group, including -R 1 or -Z^ 1 , where R 1 
and Z 1 are defined below. Preferably, R 1 is C^hydrocarbyl, more preferably Ci. 8 alkyl, especially 
30 methyl. Z 1 is preferably O or NR. 1 . When C* is a cation, A is preferably -OMe or -N(Me)2. 

When C* is an anion, A is preferably an electron-withdrawing group, including halogen, 
trihalomethyl, -N0 2 , -CN, -N+CR 1 ):^-, -C0 2 H, -C0 2 R\ -S0 3 H, -SOR 1 , -S0 2 R\ -SO3R 1 , 
-OC(=0)OR', -C(=0)H, -C^OR 1 , -OC^COR 1 , -C(=0)NH 2 , -C(=0)m} 2 , -N^C^COOR 1 , 




( Solid Supports 

'Solid supports 1 for use with the invention include polymer beads, metals, resins, columns, surfaces 
(including porous surfaces) and plates (e.g. mass-spectrometry plates). 

The solid support is preferably one suitable for use in a mass spectrometer, such that the invention 
5 can be conveniently accommodated into existing MS apparatus. Ionisation plates from mass 
spectrometers are thus preferred solid supports, e.g. gold, glass-coated or plastic-coated plates. Solid 
gold supports are particularly preferred. 

Resins or columns, such as those used in affinity chromatography and the like, are particularly useful 
for receiving solutions of biopolymers (purified or mixtures). For example, a cellular lysate could be 
10 passed through such a column of formula (IVai), (TVaii), (IVaiii), (IVaiv), (IVbii), (IVbiii) or (TVbiv) 
followed by cleavage of the support to leave compounds of formula (I). 

. Solid supports of formulae (IVai), (TVaii), (IVaiii), {TVaiv), (IVbii), (IVbiii) or .(TVbiv) will generally 
present exposed groups M capable of reacting with a biopolymer, B P . For MS analysis, ions 
preferably have a predictable mass to charge (m/e) ratio. If a biopolymer reacts with more than one 
15 M group, however, then it will carry more than one positive charge once ionised, and its m/e ratio 
will decrease; Advantageously; therefore, the groups M are arranged such that any biopolymer 
molecule will covalently link with only a single group M. Consequently, each biopolymer will, on 
ionisation, carry a single positive charge and thus have a predictable mass to charge ratio. 

Typically, the surface density of the solid supports of (TVai), (TVaii), (IVaiii), (TVaiv), (TVbii), 
20 (IVbiii) or (TVbiv) will be provided so that a biopolymer molecule can only covalently link with one 
group M and thus to prevent the formation of multiply derivatised biopolymers. 

Varying the mass of compounds of the invention 

Within the general formulae (I), (Ila), (lib), (Ilia), (Elb), (IVai), (IVaii), (IVaiii), (TVaiv), (IVbii), 
(IVbiii), (IVbiv), (Vai), (Vaii), (Vaiii), (Vaiv), (Vbii), (Vbiii) and (Vbiv), there is much scope for 
25 • variation. There is thus .much scope of variation in the mass of these compounds.. In .some 
embodiments of the invention, it is preferred to use a series of two or more (e.g. 2, 3, 4, 5, 6 or more) 
compounds with different and defined molecular masses. 

The masses of the compounds of the invention can be varied via L M , Ar 1 and/or Ar 2 . Preferably, the 
masses of the compounds of the invention are varied by varying A on the groups Ar 1 and/or Ar 2 . 

30 In this aspect of invention, compounds of the invention advantageously comprise one or more of F or 
I as substituents A of the groups Ar f , Ar 2 or Ar 3 . F and I each only have one naturally occurring 
isotope, l9 F and 127 I respectively, and thus by varying the number of F and I atoms present in the 
structure of the compounds, can provide a series of molecular mass labels having substantially 
identical shaped peaks on a mass spectrum. 




(' In order to increase the molecular mass of the compounds of the invention and to increase the 
number of available sites for substitution by A, especially F and I, one or more of Ar 1 and Ar 2 may 
be substituted by one or more dendrimer radicals of appropriate valency, either as substituent A or 
group L M . 

5 Preferred dendrimer radicals are the radicals obtained from the dendrimers of US 6,455,071 and 
PAMAM dendrimers. 

The compounds of the invention may advantageously be used in the method of analysing a 
biopolymer disclosed herein, in particular in a method for following a reaction involving a 
biopolymer, B P , since the abundance of a species of may be determined by mass spectrometry by 
1 0 measuring the intensity of the relevant peak in an obtained mass spectrum. 

Specifically, mere is provided a method for analysing biopolymer B P , comprising the steps of: 

(i) reacting a first sample comprising biopolymer B P with a compound of formula (Ha) 
or (lib) or a solid support of formula (TVai), (TVaii), (TVaiii), (TVaiv), (TVbii), (TVbiii) or (TVbiv) at a 
time ti; 

15 (ii) reacting a second sample comprising biopolymer B P with a compound of formula 

(Ua) or (lib) or a solid support of formula (TVai), (TVaii), (TVaiii), (IVaiv), (TVbii), (TVbiii) or (IVbiv) 
at a later time t 2 ; % 

(iii) preparing and analysing cations of formula (I) from the first and second samples; and 

(iv) comparing the results of the analysis from step (iii). 

20 If levels of the biopolymer B P decrease between times t, and t 2 then there will be a decrease in 
detected ion; if levels of the biopolymer B P increase between times ti and t 2 then there will be an 
increase in detected ion. The effects of stimuli on transcription and/or translation can therefore be 
monitored. 

. . Advantageously, different compounds of formula (Ha) or (Tib) or different sqlid,supports.- offormula . 

25 (TVai), (TVaii), (IVaiii), (TVaiv), (TVbii), (TVbiii) or (TVbiv) are used at different times in order to 
facilitate simultaneous and parallel analysis of the first and second samples. For example, if the two 

compounds used at times ti and t 2 differ only by a *H to 19 F substitution then the relative .abundance 

of B P at the two times can be determined by comparing peaks separated by 1 8 units. 
Advantageously, the reaction of the biopolymer with the compound of formula (Tla) or (Tib) or the 

30 solid support of formula (TVai), (TVaii), (TVaiii), (TVaiv), (TVbii), (TVbiii) or (TVbiv) will fix -the- • - 
biopolymer to prevent it reacting further and the steps of providing and analysing the cations may be 
carried out at a later convenient time. Alternatively, if the reaction of the biopolymer with the 
compound offormula (Ea) or (lib) or the solid support of formula (TVai), (TVaii), (TVaiii), (TVaiv), 



( after reaction of the biopolymer with the compound of formula (Ha), or (Eh) or the solid support of 
formula (TVai), (IVaii), (IVaiii), (IVaiv), (Wbii), (IVbiii) or (TVbiv). 

Compounds of Formulae (Ha) and (lib) 

The compounds of formulae (Ha) or (lib) are available commercially or may be synthesised by 
5 known techniques. 

Commercially available compounds of formulae (Ea) or (lib) are disclosed, for example in the 
Molecular Probes Catalogue, 2002. Commercially available trityls, and derivatives and analogues 
thereof, may also be derivatised with the groups (L M {M} p ) q by known techniques. 

Methods for synthesis of compounds of formula (Ha) or (lib) useful in the present invention are 
10 described in Chem. Soc. Rev. (2003) 32, p. 3-13, scheme 2 and "1. introduction", last two 
paragraphs. Groups (L M {M} p ) q are usually introduced into the intermediates and the compounds are 
then assembled using the appropriate pathways. Alternatively, the groups (L M -{M} p ) q may be added 
after assembly of the aromatic groups and a-carbon of the compounds. 

Methods for synthesis of compounds of formulae (Ea) or (lib) are also described in WO99/60007. 
1 5 Preferred compounds of formula (Ha), (lib) and (TVai)are: 
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V Chemical Groups 

The ions of the invention are stabilised by the resonance effect of the aromatic groups Ar 1 and Ar 2 . 
The term 'C* is a carbon atom bearing a single positive charge or a single negative charge' 
therefore not only includes structures having the charge localised on the carbon atom but also 
5 resonance structures in which the charge is delocalised from the carbon atom. 

The term 'linker atom or group' includes any divalent atom or divalent group. 

The term 'aromatic group' includes pseudo-aromatic groups, e.g. cyclopropyl and cyclopropylene 
groups. 

The term 'halogen' includes fluorine, chlorine, bromine and iodine. 

10 The term 'hydrocarbyl' includes linear, branched or cyclic monovalent groups consisting of carbon 
and hydrogen. Hydrocarbyl groups thus include alkyl, alkenyl and alkynyl groups, cycloalkyl 
(including polycycloalkyl), cycloalkenyl and aryl groups and combinations" thereof, e.g. 
alkylcycloalkyl, alkylpolycycloalkyl, alkylaryl, alkenylaryl, cycloalkylaryl, cycloalkenylaryl, 
cycloalkylalkyl, polycycloalkylalkyl, arylalkyl, arylalkenyl, arylcycloalkyl and arylcycloalkenyl 

15 groups. Preferred hydrocarbyl are C x . u hydrocarbyl, more preferably Ci.8 hydrocarbyl. 

Unless indicated explicitly otherwise, where combinations of groups are referred to herein as. one 
moiety, e.g. arylalkyl, the last mentioned group contains the atom by which the moiety is attached .to 
the rest of the molecule. 

The term 'hydrocarbylene' includes linear, branched or cyclic divalent groups consisting of carbon 
20 and hydrogen formally made by the removal of two hydrogen atoms from the same or different 
(preferably different) skeletal atoms of the group. Hydrocarbylene groups thus include alkylene, 
alkenylene and alkynylene groups, cycloalkylene (including polycycloalkylene), cycloalkenylene and 
arylene groups and combinations thereof, e.g. alkylenecycloalkylene, alkylenepolycycloalkylene, 
alkylenearylene, alkenylenearylene, cycloalkylenealkylene, polycycloalkylenealkylene, 
25 arylenealkylene and aryleriealkehylene groups. Preferred hydrocarbylene are C M4 hydrocarbylene, 
more preferably Ci- 8 hydrocarbylene. 

The term 'hydrocarbyloxy' means hydrocarbyl-O-. 



The terms 'alkyl', 'alkylene', 'alkenyl', 'alkenylene', 'alkynl', or 'alkynlene' are used herein to refer 
to both straight, cyclic and branched chain forms. Cyclic groups include C M groups, preferably Cs-g 
30 groups. ... 

The term 'alkyl' includes monovalent saturated hydrocarbyl groups. Preferred alkyl are C u& , more 
preferably C M alkyl such as methyl, ethyl, n-propyl, i-propyl or t-butyl groups. 

Preferred cycloalkyl are C5-8 cycloalkyl. 




The term 'alkenyl' includes monovalent hydrocarbyl groups having, at least one carbon-carbon 
double bond and preferably no carbon-carbon triple bonds. Preferred alkenyl are C 2 a alkenyl. 
The term 'alkynl' includes monovalent hydrocarbyl groups having at least one carbon-carbon triple 
bond and preferably no carbon-carbon double bonds. Preferred alkynl are C 2 a alkynl. 

5 The term ' aryl' includes monovalent aromatic groups, such as phenyl or naphthyl. In general, the aryl 
groups may be monocyclic or polycyclic fused ring aromatic groups. Preferred aryl are C 6 -Ci 4 aryl. 

The term 'alkylene' includes divalent saturated hydrocarbylene groups. Preferred alkylene are Cm 
alkylene such as methylene, ethylene, n-propylene, i-propylene or t-butylene groups. 

Preferred cycloalkylene are C5-8 cycloalkylene. 
10 The term 'alkenylene' includes divalent hydrocarbylene groups having at least one carbon-carbon 

double bond and preferably no carbon-carbon triple bonds.. Preferred alkenylene are C2.4 alkenylene. 

The term 'alkynlene' includes divalent hydrocarbylene groups having at least one carbon-carbon 

triple bond and preferably no carbon-carbon double bonds. Preferred alkynlene are C 2 -4 alkynlene. 

The term 'arylene' includes divalent aromatic groups, such phenylene or naphthylene. In general, the 
15 arylene groups may be monocyclic or polycyclic fused ring aromatic group*. Preferred arylene. 

C6-C 14 arylene. 

The term 'heterohydrocarbyl' includes hydrocarbyl groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
independently by O, S, Se or N, preferably O, S or N. Heterohydrocarbyl groups thus include 

20 heteroalkyl, heteroalkenyl and heteroalkynyl groups, cycloheteroalkyl (including 
polycycloheteroalkyl), cycloheteroalkenyl and heteroaryl groups and combinations thereof, e:g. 
heteroalkylcycloalkyl, alkylcycloheteroalkyl, heteroalkylpolycycloalkyl, alkylpolycycloheteroalkyl, 
heteroalkylaryl, alkylheteroaryl, heteroalkenylaryl, alkenylheteroaryl, cycloheteroalkylaryl, 
cycloalkylheteroaryl, heterocycloalkenylaryl, cycloalkenylheteroaryl, cycloalkylheteroalkyl, 

25 cycloheteroalkylalkyl, polycycloalkylheteroalkyl, polycycloheteroalkylalkyl, arylheteroalkyl, 
heteroarylalkyl, arylheteroalkenyl, heteroarylalkenyl, arylcycloheteroalkyl, heteroarylcycloalkyl, 
arylheterocycloajkenyl and heteroarylcycloalkenyl groups. 

The term 'heterohydrocarbylene' includes hydrocarbylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
30 independently by O, S, Se or N, preferably O, S or N. Heterohydrocarbylene groups thus include 
heteroalkylene, heteroalkenylene and heteroalkynylene groups, cycloheteroalkylene (including 
polycycloheteroalkylene), cycloheteroalkenylene and heteroarylene groups and combinations thereof, 
e.g. heteroalkylenecycloalkylene, alkylenecycloheteroalkylene, heteroalkylenepolycycloalkylene, 




icycloheteroalkylenealkylene, polycycloalkyleneheteroalkylene, polycycloheteroalkyleneaikylene, 
aryleneheteroalkylene, heteroarylenealkylene, aryleneheteroalkenylene, heteroaiylenealkenylene 
groups. 

Where reference is made to a carbon atom of a hydrocarbyl or other group being replaced by an O, S, 
Se or N atom, what is intended is that: 

-CH- . , — N— 

is replaced by 



-CH= is replaced by -N=; or 

— CH2- is replaced by -O-, -S- or -Se-. 

The term 'heteroalkyl' includes alkyl groups in which up to three carbon atoms, preferably up to two 
10 carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or N, 
preferably O, S orN. 

The term 'heteroalkenyl' includes alkenyl groups in which up to three carbon atoms, preferably up to 
two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or 

N, preferably O, S or N. ' • • 

1 5 The term 'heteroalkynyl' includes alkynyl groups in which up to three carbon atoms, preferably up to 
two carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or 
N, preferably O, S or N. 

The term 'heteroaryl' includes aryl groups in which up to three carbon atoms, preferably up to two 
carbon atoms, more preferably one carbon atom, are each replaced independently by O, S, Se or N, 
20 preferably O, S or N. Preferred heteroaryl are Cs-uheteroaryl. Examples of heteroaryl are pyridyl, 
pyrrolyl, thienyl or furyl. 

The term 'heteroalkylene' includes alkylene groups in which up to three carbon atoms, preferably up 
to two carbon atoms, more preferably one carbon atom,- are each replaced independently by O, S, Se 
or N, preferably O, S or N. 

25 The term 'heteroalkenylene' includes alkenylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
. independently by O, S, Se or N, preferably O, S or N. 
The term 'heteroalkynylene 4 include alkynylene groups in which up to three carbon atoms, 
preferably up to two carbon atoms, more preferably one carbon atom, are each replaced 
30 independently by O, S, Se or N, preferably O, S or N. 

The term 'heteroarylene' includes arylene groups in which up to three carbon atoms, preferably up to 




N, preferably O, S or N. Preferred heteroarylene are C s .i4ieteroarylene. Examples of heteroarylene 
are pyridylene, pyrrolylene, thienylene or furylene. 

Substitution 

A is independently a substituent, preferably a substituent Sub 1 . 

5 S^ 1 is independently halogen, trihalomethyl, -N0 2 , -CN, -N+CR^Cr, -C0 2 H, -COjR 1 , -S0 3 H, -SOR 1 , 
-SOaR 1 , -SO3R 1 , -OC^COOR 1 , -C(=0)H, -C^R 1 , -00(0)^ , -NR 1 * -C(=0)NH 2 , -C(=0)NR 1 2 , 
-NCR^CC^OR 1 , -N(R I )C(=0)NR , 2, -OC(=0)NR 1 2 , -NCR^C^OR 1 , -C(=S)NR 1 2 , -MR. 1 C(=S)R 1 , 
-SOzNR^, -NR'SQaR 1 , -N(R 1 )C(=S)NR 1 2 , -NOR^SC^NR 1 * -R 1 or -Z 1 ^. 

ZMs O, S, SeorNR 1 . 

10 R 1 is independently H, d. 8 hydrocarbyl, Ci. 8 hydrocarbyl substituted with one or more S U b 2 , 
Ci-sheterohydrocarbyl or Ci-sheterohydrocarbyl substituted with one or more S U b 2 - 

S u b 2 is independently halogen, trihalomethyl, -N0 2 , -CN, -N^Ci^alkylkO", -C0 2 H, -C0 2 Ci-6alkyl, 
-S0 3 H, -SOC^alkyl, -S0 2 C^alkyl, -S0 3 C^alkyl, -OC(=0)OC^alkyl, -C(=0)H, -C(=0)C^alkyl, 
-OC(=0)C^alkyl, -N(C M alkyl) 2 , -C(=0)NH 2 , -C(=0)N(C,^alkyl) 2 , 

15 -NCd-galky^CC^OCCsalkyl), -N(Ci^alkyl)C(=0)N(C^alkyl) 2 , -OC(=0)N(C,^alkyl) 2 , 
-N(Ci^alkyl)C(=0)C^alkyl, -C(=S)N(Ci^alkyl)2, -N(C 1 ^alkyl)C(=S)C 1 ^alkyl, -S0 2 N(Ci^alkyl) 2 , 
-N(C,_ 6 alkyl)S0 2 C 1 . 6 alkyl, -N(C 1 ^alkyl)C(=S)N(C 1 ^alkyl) 2 , -N(C^alkyl)S0 2 N(C,^alkyl)2, C^alkyl 
or-Z'Ci^alkyl. 

Where reference is made to a substituted group, the substituents are preferably from 1 to 5 in number, 
20 most preferably 1. 

However, molecular mass labels of the invention will generally comprise 1 or more, typically 
between 1 and 100 (e.g. 1 to 50, preferably 1 to 20) substituents S^ 1 or S ub 2 , typically F or I, in order 
to vary the masses of the molecular mass labels. 

Miscellaneous 

25 A may optionally be a monovalent dendrimer radical or a monovalent dendrimer radical substituted 
with one or more substituents Sub 1 . 

General 

The term "comprising 55 means "including 55 as well as "consisting 55 e.g. a composition "comprising 55 X 
may consist exclusively of X or may include something additional e.g. X + Y. 

30 The term "about 55 in relation to a numerical value x means, for example, x±10%. 

The word "substantially 55 does not exclude "completely 55 e.g. a composition which is "substantially 
free 55 from Y may be completely free from Y. Where necessary, the word "substantially 55 may be 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 demonstrates conceptually the effect "of the signal on a mass spectrum by a compound of 
formula (Ha) or (lib) of the invention. Free biopolymer, such as a peptide, has poorer desorption 
5 properties characterised by a smaller peak on the left of mass-spectrum whereas desorption improves 
when the same molecule is conjugated to a compound of the invention. 




{ Figure 2 shows the steps of biopolymer with a compound of formula (IVai). The derivativisation of a 
biopolymer with a compound of the invention can be carried out more conveniently by utilising the 
solid phase-based format, whereby the compound is temporarily covalently attached to a solid 
support. This eliminates all the separation steps associated with homogenous approach as the only 

5 additional step required would be a washing step. The solid support can be a resin, a surface or a 
porous surface. Alternatively, the solid support may be a mass-spectrometry sample plate, which 
dramatically decreases the sample preparation time. Both gold, glass- and plastic-coated plates are 
compatible with this approach. 

Figure 3 shows the steps of 'reverse 1 biopolymer derivativisation on a covalent solid support whereby 
10 the release of the biopolymer derivative happens simultaneously with the derivativisation process. 
The process is applicable M groups involving leaving groups. 

Figure 4 shows the steps of biopolymer derivativisation on an ionic solid support. 

Figure 5 shows of the steps of solid support-assisted biopolymer derivativisation. The biopolymer is 
first trapped onto a solid support and then labelled with a compound of formula (Ila) or (lib). An 
15 advantage of this technique is that a preliminary sample enrichment occurs, since not all of the 
. . . biopolymer. in the sample will stick to the solid support surface. ........ 

Figure 6 shows the mass spectrum obtained when analysing an Gly-Gly-O-acyl dipeptide conjugated 
with a trityl compound of the invention. 

Figure 7 shows the mass spectrum obtained when analysing a conjugate of a peptide with a trityl 
20 compound of the invention. 

Figure 8 compared the mass spectra of a BSA digest without (8 A) and with (8B) labelling. 

Figure 9 shows the mass spectrum obtained when analysing a mixture of trityl-labelled amines. 

MODES FOR CARRYING OUT THE INVENTION 
Materials and Methods 

25 The solid supports were Tenta Gel Macrobeads OH and NH 2 , 280-320 microns, Rapp Polymer. 
(MA)LDI-TOF mass-spectra were recorded on a PE-ABI Voyager™ Elite Reflectron Delayed 
Extraction Instrument. TLC were carried out with Merck silica gel (Kieselgel 60 F254 precoated 
plates and Kieselgel 60 0.040-0.063 mm). HPLC was carried out on a Waters system (Milford, MA, 
USA). Phosphoroamidite couplings were carried out in an ABI 394 DNA/RNA synthesiser. 

30 Chemicals and solvents were from Sigma/Aldrich/Fluka (USA), and BDH/Merck. 

Example 1 — Conjugation of a trityl tag (in solution phase) with solid support-bound biopolymer 
A 15mer poly-T oligonucleotide was synthesised on an ABI 394 DNA synthesiser using a T CPG 
support according to standard protocols of phosphoramidite chemistry on 0.2 |imol scale. After the 
-last coupling,- a/MMTr-protected.v ammqUnkV^ USA) was -added $<k£j - e . 



i removed from the synthesiser, and after 10 min wash with acetonitrile it was attached to two 5 ml 
syringes and washed with a 0.1M solution of NHS-activated 4,4'-dimethoxy-4"-carboxyethyl trityl 
for 10 min at RT. The column was then washed with (3 x 10 ml) acetonitrile, placed on a DNA 
synthesiser and deprotected with ammonia according to standard protocols. The residue obtained 

5 after the evaporation was dissolved in 0.1 ml of 2M LiC10 4 and precipitated from cold acetone (1.5 
ml). The precipitate was washed with 0.5ml of acetone and dried. 

Example 2 — Homogenous conjugation of a trityl with non-polymeric ligands 

A solution of NHS-activated 4 ? 4 r -dimethoxy-4 H -carboxyethyl trityl (0.1M) in THF/dioxane (1 : 1) was 

mixed with a solution (0.5-1M) of an amine or of a mixture of amines (for example, propyl amine, 

10 butyl amine, pentyl amine, hexyl amine and phenethyl amine), typically 10 ml of a solution of an 
activated trityl with 5 ml of an amine solution. The mixtures were purified on prep-TLC (2mm-thick 
glass plates with UV254 indicator, Analtech/Aldrich-Sigma), typically in chloroform with 0.5% 
triethylamine. The areas containing the desired products were scratched off the plate, and the 
conjugates or the mixtures thereof were eluted using same solvent with 2-5% MeOH, filtered through 

15 a layer of glass wool, evaporated and dried. 

Example 3 — Homogenous conjugation of a nhs-activated trityl with polymeric ligands vm 
A peptide, an oligonucleotide, or any other biopolymer containing a (primary) amino group, is 
dissolved in a mixture of water and acetonitrile depending on its solubility, typically 20-50% of 
water in CH 3 CN. Non-aminogroup-containing buffers (ie. 50 mM sodium phosphate, 0.15 M NaCl, 
20 pH 7.2, or a bicarbonate buffer, but an additional desalting step may then need to be introduced to" cut 
off the metal ions prior to mass-spectrometry) can be used to keep the pH at between 7-9. "For 
particularly poorly soluble ligands other solvents may be used such as THF, DMSO, etc. £ 

A solution of NHS-activated 4,4 f -dimethoxy-4 ,! -carboxyethyI trityl in acetonitrile or THF is added in 
approx. 5-10 times excess compared to an amine component. Conjugation usually reaches the 
25 maximum yield over 2-4 hours of reaction time. The conjugate formed can be analysed by MS 
directly, or after HPLC-purification. - 

Example 4 — Conjugation of a solid phase-immobilised nhs-activated trityl tag with a ligand 

A Solid Phase-Immobilised NHS-Activated Trityl Tag was prepared by either method 1 or method 2. 

Method 1 : A NHS-Activated 4,4 , -dimethoxy-4"-carboxyethyl trityl tag was covalently attached to 
30 hydroxyl groups of 200 jam Rapp Polymer beads by shaking the suspension of 100 mg of the resin in 
5 ml of 0. 1 M solution of trityl chloride tag in dry. pyridine at +4°C for 3 hours and then washing the . 
resin with pyridine and acetonitrile and drying in vacuo. 

Method 2 . A 5'-tritylated thymidine phosphoramidite was prepared from NHS-activated 4,4'- 
dimethoxy-4"-carboxyethyl trityl chloride in a standard way [MJ. Gait, Oligonucleotide Synthesis: A 



1 phosphoramidite on an ABI DNA synthesiser using manuai supply of reagents (O.IM solution of a 
phosphoramidite and other standard phosphoramidite synthesis reagents) with a coupling step of 15 
min. The second column was first derivatised with a trebler phosphoramidite (Glen Res.) according 
to the manufacturers protocols and then coupled with' the trityl tag-containing phosphoramidite as 

5 described for the first column. Both columns were excessively washed with acetonitrile. 

The trityl loading of the solid supports produced by either method was determined 
spectrophotometrically (absorbance measurements at 490nm) to be 0.21 mmol/g for a. straight 
attachment and 0.39 mmol/g for a tritylation on top of the trebling synthon. (The hydroxyl group 
loading of the Rapp polymer used was 0.25mmol/g). 

10 To the solid support prepared as described above, a mixture of compounds to be labelled (typically 
peptides) is added, typically in a. mixture of 20-50% water in acetonitrile. After incubation, with 
occasional shaking, for 60-120 min the resin is washed with several volumes of the same solvent, and 
the conjugated products are cleaved off the resin, typically by adding 0.5-2% TFA in appropriate 
solvent. The collected sample is then analysed by MS. 

1 5 Example 5 — Mass spectrometry analysis of a derivatised Gly-Gly dipeptide 

Figure 6 shows the mass spectrum obtained from a compound of the invention comprising a 
derivatised Gly-Gly-O-acyl dipeptide biopolymer. 

The ion of formula (I) containing the derivatised Gly-Gly-O-acyl biopolymer is observed at the peak 
at molecular weight 5 16.5. There was no peak corresponding to the free dipeptide. 

20 The fragment of formula (VI), in which the derivatised Gly-Gly-O-acyl biopolymer has been lost, is 
observed at the peak at the molecular weight 374.6. 

Example 6 — Mass spectrometry analysis of a derivatised peptide 

Figure 7 shows the mass spectrum obtained from a compound of the invention comprising a 
derivatised peptide biopolymer. The free peptide had a molecular weight of 310. 

25 The ion of formula (T) containing the derivatised peptide biopolymer is observed at the peak at 
molecular weight 665.0. 

The fragment of formula (VI), in which the derivatised peptide has been lost, is observed at the peak 
at the molecular weight 375.0. 

Significantly, there is only a very small peak at molecular weight 310, where a peak corresponding to 

30 the free biopolymer would be found. The relative size of the peaks at 665.0 and 3 10 thus demonstrate 

the significantly improved ionisability of the compounds of the invention compared with free 
biopolymer. 

Example 7 — Spectral improvement by trityls 
j^^^SL^Ehree prote^i^^i^Blcasein \^A?^^^^&x^d\SdsbsA -wtKHt^s^radit&e ies6\^s^m^^d^^^-x^- 




i identified for each protein is shown below. The theoretical total number of peptides that would be 
produced by trypsin digestion of each protein was calculated in silico and is shown in the second 
column the table below. 



Protein 


Number of 
theoretical peptides* 


Total number of peptides identified 


MASCOT search score* 


Underivatised 


Derivatised 


Underivatised 


Derivatised 


BSA 


144 


14(10%) 


41 (28%) 


132 


126 


P-casein 


27 


4 (15%) 


13 (48%) 


no match 


123 


ADH 


60 


7 (12%) 


18 (30%) 


77 


111 



+ The number of theoretical peptides for each protein was generated assuming one 
5 missed cleavage and disregarding di- and mono-amino acids generated. 



* Score is -10*Log(P), where P is the probability that the observed match is a 
random event Protein scores greater than 63 are significant (p<0.05). 

Derivatisation of peptides with trityl groups of the invention thus improves detection, as a 
significantly larger number of peptides was detected for each of the three proteins when 
10 derivatisation was used. Furthermore, protein identification by mass fingerprinting can be improved. 

Taking p-casein as an example, the number of detectable fragments more than tripled, and the 
derivatised spectrum allowed a MASCOT-based identification which was not previously possible. 

Example 8 — BSA fragmentation and mass spectrometry j 
Bovine serum albumin (BSA) was digested with trypsin and analysed by MALDI-TOF. The resulting 
15 spectrum is shown in Figure 8A. The experiment was repeated, but the peptide mixture was labelle<| 
with a dimethoxytrityl label after trypsin digestion. The spectrum in Figure 8B shows the dramatic 
increase in visible ions due to the trityl label. Four specific peptides have been highlighted in both" 
spectra. ■ 

Example 9 — Mass spectrometry of amines 
20 A solution of NHS-activated 4,4 , -dimethoxy-4 ,, -carboxyethyl trityl (0.1M) in THF/dioxane (1:1) was 
mixed with a solution (0.5-1M) of an amine or of a mixture of amines (for example, propyl amine, 
butyl amine!, pentyl amine, hexyl amine and phenethyl amine), typically 10 ml of a solution of an 
activated trityl with 5 ml of an amine solution. The mixtures were purified on prep-TLC (2mm~thick 
glass plates with UV254 indicator, Analtech/Aldrich-Sigma), typically in chloroform with 0.5% 
" 25~ triethylamine. The areas containing the desired "products were scratched off the plate, and the 
conjugates or the mixtures thereof were eluted using same solvent with 2-5% MeOH, filtered through 
a layer of glass wool, evaporated and dried. Figure 9 shows a spectrum obtained in this way. 



It will be understood that the invention is described above by way of example only and modifications 
may be made whilst r emainin g within the scope and spirit of the invention. 
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Figure 1 

Trityl Tagging 




Unique mass-spectrum of the mass-tagged 
(bio)molecule 



AFG, activated functional group, i.e. NHS or maleimido 

FG, functional group, i.e. NH 2 or SH 

W, a bond between AFG and FG 

(Bio)molecule, a molecule to be analysed, i.e. a peptide 
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Figure 2 

Trityl Enhancer Tagging on Solid Support 




AFG, activated functional group, i.e. NHS or maleimido 

FG, functional group, i.e. NH 2 or SH 

W, a bond between AFG and FG 

(Bio)molecule, a molecule to be analysed, i.e. a peptide 
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Figure 3 

Reverse Trityl Enhancer Tagging on Solid Support 




Unique mass-spectrum of the mass-tagged 
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Figure 4 

Trityl Enhancer Tagging on Solid Support 2 
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MS detection of a cation 



Unique mass-spectrum of the mass-tagged 
(bio)molecule 



AFG, activated functional group, i.e. NHS or maleimido 

FG, functional group, i.e. NH 2 or SH 

W, a bond between AFG and FG 

(Bio)molecuie, a molecule to be analysed, i.e. a peptide 
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Figure 5 

Solid Support-Assisted Trityl Enhancer Tagging 
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Figure 6 
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Figure 8 

Figure 8A 



1001 



80 



CO 



60 



JE 40 
20 



■a 



650 



a: 
< 



LU 

LU 

> 



?1 



s en 



CL 



I 



SI 



1020 



1390 



a 

2 III 



1760 



Mass (m/z) 



s 

I 



2130 



2500 



to 
c 

0 



5 



sis 

s 



LU 

>- 



55« 



I 



Mass (m/z) 



cr 

> 



52 
§3 



CL .. 

a: 




PCT/GB2004/005140 




This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

/□ black borders 

'image cut off at top, bottom or sides 
3~faded text or drawing 
□'bljirred or illegible text or drawing 
o skewed/slanted images 

□ color or black and white photographs 

□ gray scale documents 

lines or marks on original document 

□ referenced) or exhibit(s) submitted are poor quality 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



